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Abstract 

We study the nonparametric estimation of the jump density of a re- 
newal reward process from one discretely observed sample path over [0, T]. 
We consider the regime when the sampling rate A = A^ — > as T — > oo. 
The main difficulty is that a renewal reward process is not a Levy process: 
the increments are non stationary and dependent. We propose an adaptive 
wavelet threshold density estimator and study its performance for the L p 
loss, p > 1, over Besov spaces. We achieve minimax rates of convergence 
for sampling rates At that vanish with T at polynomial rate. In the same 
spirit as Buchmann and Grubel (2003) and Duval (2012), the estimation 
procedure is based on the inversion of the compounding operator. The 
inverse has no closed form expression and is approached with a fixed point 
technique. 
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1 Introduction 

1.1 Motivation and statistical setting 

Renewal reward processes are pure jump processes used in many application 
fields, for instance in seismology (see Alvarez [1] or Helmstetter et al. [15]), to 
model rainfall (see Rodriguez-Iturbe et al. [24]) or in mathematical insurance 
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and finance (see for instance Scalas et al. [26, 27] or Masolivier et al. [20]). If 
many papers are devoted to the estimation of a discretely observed Levy process 
(see for instance Bee and Lacour [2], Comte and Genon-Catalot [5, 7], Figueroa- 
Lopez [13] and Duval [11] for the high frequency case and Neumann and Reifi[23] 
and Comte and Genon-Catalot [6] for the low frequency one), to the knowledge 
of the author, little exists on the estimation of a discretely observed renewal 
reward process. Vardi [29] estimates the density of a renewal process without 
rewards from the continuous observation of several independent trajectories. In 
this paper we estimate the compound law of a renewal reward process when 
one trajectory is observed at a sampling rate that goes to arbitrarily slowly. 

Let J\ , . . . , Ji be nonnegative independent random variables where J2, . . . , J% 
are identically distributed. Define the time of the ith jump as T{ = J\+...+Ji, 
i > 1. The associated counting process or renewal process R is 

00 

R t = J2M<t, t>o. 

i=l 

The Poisson process is a particular case of a renewal process, corresponding 
to exponentially distributed interarrivals (Jj). That latter case excepted, R 
does not have independent increments and is usually not stationary i.e. for 
all positive t, h the law of Rt+h ~ Rt depends on t. Assume that the common 
distribution r of the (</«) has finite expectation 

("OO 

fi = / tr(dt) < 00, 
^0 

define the distribution 

l-J X r(dt) 

T Q {X) = ^ . (1) 

The process R is stationary if and only if J\ has distribution To (see Lindvall 
[19] p. 70). Define the renewal reward process X as 

Rt 
1=1 

where the (£j) are independent and identically distributed random variables, 
independent of the interarrivals (JA . Renewal reward processes also correspond 
to decoupled continuous time random walks. 
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Assume that we have discrete observations of the process A over [0, T] at 
times iA for some A > 

(A A , . . . ,X|_ TA -ija)- ( 2 ) 
We focus on the microscopic regime namely 

A = A T -)■ as T oo, 
and work under the following assumption. 

Assumption 1. The law of the £i has density f which is absolutely continuous 
with respect to the Lebesgue measure. 

The law of the Ji, i > 2 has density r which is absolutely continuous with respect 
to the Lebesgue measure and J\ has density tq. 

The necessity of the last part of Assumption 1 is discussed in Section 5. 

We denote by ^(K) the space of densities with respect to the Lebesgue 
measure supported by M.. We investigate the nonparametric estimation of the 
density / on a compact interval P of M from the observations (2). To that end 
we use wavelet threshold density estimators and study their rate of convergence, 
uniformly over Besov balls, for the following loss function 

m\f-f\\ p L P <p ) ]) 1/p > (3) 

where / is an estimator of /, p > 1 and \\.\\ L ^) denotes L p loss over the 
compact set T>. We do not assume the interarrival distribution r to be known: 
it is a nuisance parameter. 

We estimate / from the increments of X, which are dependent. By As- 
sumption 1, on the event {A^a — A(j_i)A = 0} no jump occurred between 
{i — 1)A and iA so that the increment AjA — -^U—i)A gives no information on 
/. In the microscopic regime A = A^r —> many increments are zero, therefore 
to estimate / we focus on the nonzero increments. We denote by their 
number over [0,T]. In that statistical context different difficulties arise; the 
number of data Nt used for the estimation is random, the increments are de- 
pendent, but more importantly on the event {AjA — A^^a 7^ 0}, the density 
of AjA — A(j_!)A is not /. Indeed even if A is small there is always a positive 
probability that more than one jump occurred between {i — 1)A and iA. Con- 
ditional on {AjA — A(j_x)A 7^ 0}, the law of A^a — A^^a has density given 
by (see Proposition 1 below) 

oo 

PA [/](*) = Y, ¥ ( Ra = m \ R ^ /0)r m (x), forxGR, (4) 

771=1 
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where * is the convolution product and f* m = /★...*/, m times. Hereafter 
Lemma 1 gives for A small enough 

1 -2r(0)A < F(R A = m\R A + 0) < 1. (5) 

We deduce from (5) the decomposition 

P A [/] = / + r(A), 

where r(A) is a deterministic remainder of the order of A. We will see in 
Theorem 1 that if A = A^ goes to fast enough, namely TA T = O(l) (up 
to logarithmic factor in T) r(A) is negligible and it is possible to estimate / 
with optimal rates by ignoring the remainder r(A). Otherwise, when there 
exists < 5 < 1 such that TA T = 0(T' 5 ) (up to logarithmic factors in T) 
the remainder r(A) is no longer negligible. The condition 8 < 1 ensures that 
At goes to as T tends to infinity. In the sequel we distinguish two different 
regimes that will be treated separately. 

• Fast microscopic rates when -up to logarithmic factors in T— 

TA T = 0(1). 

• Slow microscopic rates when there exists < 5 < 1 such that -up to 
logarithmic factors in T— 

TA T = 0(T 5 ). 

Since all the results of the paper are given up to logarithmic factors in T, fast 
and slow microscopic rates cover all vanishing behaviours for A = At- We 
try to answer the following question: Is it possible to construct an adaptive 
wavelet estimator of / in fast and slow microscopic regimes which is optimal? 
Papers which estimate nonparametrically the Levy measure from a discretely 
observed Levy process attain optimal rate estimators only for fast microscopic 
rates (see for instance Bee and Lacour [2], Comte and Genon-Catalot [5, 6, 7] 
and Figueroa-Lopez [13]). 

1.2 Our Results 

In Section 2 we estimate / in the fast microscopic regime, the estimation pro- 
cedure is based on the approximation 

/~Pa[/]- 
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We construct an adaptive wavelet threshold density estimator from the obser- 
vations (2). It achieves the minimax rate of convergence which is T~ a(s ' p,7r ) if / 
is of regularity s measured with the L n norm, ir > 0, and where a(s, n,p) < 1/2 
(see (16) hereafter). That procedure does not depend on the interarrival density 
r apart from Assumption 1. Moreover the estimator does not explicitly depend 
on the random quantity Nt, the number of nonzero increment. 

In Section 3 we estimate / in the slow microscopic regime, the estimation 
procedure is the analogue of the one used in Duval [11]. The starting point is 
that 

/ = P A 1 [P A [/]], 

and we proceed in two steps to estimate /. The first step is the computation 
of the inverse of the operator Pa defined in (4). That step can be referred 
as decoumpounding as introduced in Buchmann and Griibel [3] or van Es et 
al. [28]. That inverse cannot be explicitly calculated, contrary to [11], but can 
be approached using a fixed point method. Indeed / is a fixed point of the 
operator 

H Aif :h^P A [f} + h-P A [h] 

which is a contraction if h and / verifies suitable smoothness properties (see 
Proposition 2 below). The Banach fixed point theorem guarantees that for K 
in N and p > 1, 

||h a *[p a [/]]-/|| 

is small in a sense that we precise later. Next we observe that the Taylor 
expansion of order K in A of H^[Pa[/]] takes the form 

K+l 

^/ m (A)P A [/H, (6) 

771=1 

where the (Z m (A)J depend on the unknown interarrival density r (see Propo- 
sition 1 below). If t is described by an unknown parameter •& € R then 
£ m (A) = Z m (A, ■&) is estimated by plugging an estimator of 

The second step consists in estimating the densities Pa[/]*"\ for m = 
1 , . . . , K + 1 . For that we focus on the Nt nonzero increments which have 
density Pa[/]- The difficulty here is that we have Nt dependent observations 
where Nt is a random sum of dependent variables. The dependency of the 
increments is treated using that at each renewal times the renewal process 
forgets its past. To cope with the randomness of Nt, we prove that Nt/T 
concentrates for T large enough around a deterministic limit using Bernstein 
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type inequalities for dependent data (see Lemma 5 in Section 6 and Dedecker et. 
al. [9]). In Theorem 2 we show that wavelet threshold estimators of Pa [/]*"* 
attain a rate of convergence -up to logarithmic factors- in T~ a ^ s ' 7T ' p ' 1 . We inject 
those estimators into (6) and obtain an estimator of / that we call estimator 
corrected at order K . 

The study of the rate of convergence of the estimator corrected at order K 
requires to control two distinct error terms. A deterministic one due the first 
step which is the error made when approximating / by (6). And a statistical 
one due to the replacement of the PaL/]*" 1 by estimators in the second step. 
The deterministic error decreases when K increases. We choose K sufficiently 
large for the deterministic error term to be negligible in front of the statistical 
one. We give in Theorem 2 an upper bound for the rate of convergence of the 
estimator corrected at order K which is in -up to logarithmic factors- 

max{T- Q ( s '^,A^ +1 }. 

Since a(s, ir,p) < 1/2 if there exists Kq such that 

TA 2K +2 < 

the estimator corrected at order Kq attains the optimal rate. 

Remark 1. There is a slight difference of methodology between fast and slow 
microscopic rates to estimate f; for fast rates we estimate f using all the incre- 
ments but in slow rates we focus on nonzero ones. In that latter case, building 
an estimator using all the increments, even zero ones, achieving the rates of 
Theorem 2 is possible but numerically unstable. And a technical constraint in 
the proof of the concentration of Nt/T prevented us from having a unified pro- 
cedure for fast and slow microscopic rates. 



The paper is organised as follows. In Section 2 we give an adaptive minimax 
estimator of / in the fast microscopic regime. In that Section we also define 
wavelet functions and Besov spaces that are used for the estimation and describe 
the law of the increments. Those results are also used in Section 3 where we 
give an adaptive minimax estimator of / in the slow microscopic regime. In 
both Sections 2 and 3 we give upper bounds for the rate of convergence of the 
estimator of / for the L p loss defined in (3), p > 1, uniformly over Besov balls. 
In Section 4, a numerical example illustrates the behavior of the estimators of 
/ introduced in Sections 2 and 3. Finally Section 6 is dedicated to the proofs. 
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2 Estimation of / in the fast microscopic regime 



2.1 Preliminary on Besov spaces and wavelet thresholding 

For the estimation, we use wavelet threshold density estimators and study their 
performance uniformly over Besov balls. In this paragraph we reproduce some 
classical results on Besov spaces, wavelet bases and wavelet threshold estimators 
(see Cohen [4], Donoho et al. [10] or Kerkyacharian and Picard [17]) that we 
use in the next sections. 



Wavelets and Besov spaces 

We describe the smoothness of a function with Besov spaces on T>. We recall 
here some well documented results on Besov spaces and their connection to 
wavelet bases (see Cohen [4], Donoho et al. [10] or Kerkyacharian and Picard 
[17]). Let (V ; a) a be a regular wavelet basis adapted to the domain V. The 
multi-index A concatenates the spatial index and the resolution level j = |A|. 
Set Aj := {A, |A| = j} and A = Uj>_iAj, for / in L p (R) we have 

j>-l AgAj 

where j = — 1 incorporates the low frequency part of the decomposition and 
(., ) denotes the usual L2 inner product. For s > and tt £ (0, 00] a function / 
belongs to the Besov space B^^iV) if the norm 

SfooCP) := WfW^iv) + \\f {n) \\L^(v) + ' - r (8) 



t a 



Loc (V) 



is finite, where s = n + a, nEN and a £ (0, 1], w is the modulus of continuity 
defined by 

^ (/ ,t) = sup ||D /l D /l [/]|| L 

l/l<* A ' 

and Y) h [f](x) = f(x — h) — f(x). Equivalently we can define Besov space in 
term of wavelet coefficients (see Hardle et. al. [14] p. 123), / belongs to the 
Besov space £>£ 00 ('D) if the quantity 

sup2^+v^)(£ K/ ,v, A >r) 1A 

is finite, with usual modifications if tt = 00. 
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We need additional properties on the wavelet basis (ipx) x , which are listed 
in the following assumption. 

Assumption 2. For p > 1, 



• We have for some £ > 1 

£-l 2 |A|(p/2-l) < 



For some <t > 0, a > and for all s < a, J > 0, we have 

11/ - E E vmHw ^ ^- Js wn\B ioB ipy 

j<j xeAj 

If P > 1, /or some C > 1 anc? /or any sequence of coefficients (u\ 

1/2 



(9) 



AeA' 



C" 1 J]ti A V; 

AeA 



< 



L P (D) 



E I u ^aI 

AeA 



L P (D) 



< d E Ua ^ ; 



AeA 



(10) 



For any subset Aq C A and for some €. > 1 



AeAo ,/:D AeA AeAo 



;a IIl p (©)- 



;n) 



Property (9) ensures that definition (8) of Besov spaces matches the defi- 
nition in terms of linear approximation. Property (10) ensures that is 
an unconditional basis of L p and (11) is a super-concentration inequality (see 
Kerkyacharian and Picard [17] p. 304 and p. 306). 



Wavelet threshold estimator 

Let (<j), tjj) be a pair of scaling function and mother wavelet that generate a basis 
{ip\) x satisfying Assumption 2 for some a > 0. We rewrite (7) 

/ = E a 0fc^0fc + E E fyk^jk, 
fceAo j>i feeAj 

where (pok(') = </>{* — k) and ipjk( m ) = 2^ 2 il){2 : > • — fc) and 

"ofc = y <fiok(x)f(x)dx 
Pjk = / ipjk{x)f(x)dx. 
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For every j > 0, the set Aj has cardinality 2 J and incorporates boundary terms 
that we choose not to distinguish in the notation for simplicity. An estimator 
of a function / is obtained when replacing the (aofc) an d (Pjk) by estimated 
values. In the sequel we uses ('Jjk) to design either (aofc) or (Pjk) an d (<7jjfc) for 
the wavelet functions ((fiok) or W'jfc)- 

We consider classical hard threshold estimators of the form 

J 

/(•) = Yl s ofc^ofc(») + ^ ^^{IM^}^^*)' 

fceA i=i fceAj 

where So^ and are estimators of aofc and /3jfc, J and 77 are respectively the 
resolution level and the threshold, possibly depending on the data. Thus to 
construct / we have to specify estimators (ffjk) of the (jjk) and the coefficients 
J and rj. 

2.2 Construction of the estimator 

Assume that we have [^A -1 ] discrete data at times iA for some A > of the 
process X 

(X A , ■ ■ • , ^iTA-iJA)- 

Introduce the increments 

V A Xi = X iA - X {i _ 1)A , for i = 1, . . . , LTA- 1 ] , 

where Xq = 0. By Assumption 1, they are identically distributed but not 
independent. 

Proposition 1. The distribution of the increment ~D A X\ is 

(l-p(A))* +p(A)P A [/] 

where 5q is the dirac delta function, p(A) = ¥(R A 7^ 0) and 

00 

Pa[/]= J2p m (A)f m , (12) 

m=l 

where * is the convolution product, f* m is f convoluted m times and 

Pm (A) =¥(R A = m\R A ^0). 
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It is straightforward to verify that the operator Pa is a mapping from .F0R) 
to itself. The following Lemma gives a polynomial control of the coefficients 
(p m (A)). It is widely used in Sections 2 and 3 and does not depend on the rate 
at which At decays to 0. 

Lemma 1. Assume r(0) > and let Aq be such that 

r(t)dt < - and sup r(t) < 2r(0). 
o 2 te[o,Ao] 

For all A < Ao we have 

1 - 2r(0)A < pi(A) < 1, 

and for m > 2 

0<p m (A)<2 (2T(0) f 
ml 

where the (p m (A)) are defined in Proposition 1. 

Remark 2. The assumption r(0) > in Lemma 1 ensures that the given 
inequalities are sharp. In the Poisson case it is always true since r(0) is the 
positive intensity. In the renewal case we may have r(0) = ; if so two cases 
must be distinguished. The first one is when r as infinitely many derivatives 
null at 0; it is the case if r is bounded away from 0. Then straightforward 
computations give for any K in N: pi(A) = 1 + O(A^), thus the procedure of 
Section 2 enables to achieve optimal rates even in slow microscopic regimes. It 
is not the purpose of this paper. The second case is r(0) = but there exists 
Iq in N such that r^°)(0) > 0, then Lemma 1 can be adapted replacing r(0) by 
t^°'(0) and A by A' . In the sequel we assume that r(0) > and leave to the 
reader the changes to be made when r(0) = 0. 

In this Section we consider the regimes for which A = At is such that 
TA T = O(l), up to logarithmic factors in T. To estimate /, we use the approx- 
imation Pa t [/] ~ /• It is equivalent to consider that nonzero increments are 
realisations of /. We construct wavelet threshold density estimators of Pa[/] 
from the observations 

(B A X l ,i = l,...,[TA- 1 \). 
Define the wavelet coefficients 

%fc= (l-p(A))LTA-ij 9jk \ B X V 1 {d^o}' ( 13 ) 
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where p(A) is defined in Proposition 1. Let r/ > and J 6 N\{0}, the estimator 
Pa of Pa[/] is f° r x inD 

J 

Pa(x) = ^a ok <j) 0k (x) + ^^(3 jk tr~ ^^ jk (x). (14) 

k j=0 k 3 k — ) 

Definition 1. We define /t,a o- n estimator of f for x in T> as 

fT,A(x) = PA(x). (15) 

2.3 Convergence rates 

We estimate densities / which verify a smoothness property in term of Besov 
balls 

T(s,tt,WI) = {fe T(R), Wfh^v) < art}, 

where 97t is a positive constant. We are interested in estimating / on the 
compact interval V, that is why we only impose that its restriction to V belongs 
to a Besov ball. 

Theorem 1. We work under Assumptions 1 and 2, let At be such that TA T = 
O(l) up to logarithmic factors in T. Let tt > 0, a > s > 1/tt, p > 1 A tt and 
Pa t be the wavelet threshold estimator of P a t [/] on V constructed from (</>, ijj) 
and defined in (14). Take J such that 

2 J T- 1 log (T 1 / 2 ) < 1, 

and 

n = /cT-Vyiog (TV2), 

for some k > 0. Let 

i \ ■ ( s s + l/p-l/ir \ ( , 

a(s, p, tt) = mm < , — p , r- (Id) 

V ^ ; L2a + l'2(s + l/2-l/7r)J 

i j JTte estimator Pa t verifies for large enough T and sufficiently large n > 

sup (e[||pT t -PA T [f]\\ P L p{ v)]) 1/P < er^'^\ 
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up to logarithmic factors in T and where <t depends on s, 7T,p, StJt, (j>, if), fj,. 
2)The estimator /t,A t defined in (15) verifies for T large enough, sufficiently 
large k > and any positive constants a < a 

sup sup (E[||/ r , AT - f\\i p(v) ]) 1/P < CT-«(^\ 

up to logarithmic factors in T, where fi = j tr(t)dt and where <£ depends on 
s, 7r, p, Wl, 4>,ip,a and a. 



The proof of Theorem 1 is postponed to Section 6.1. Theorem 1 guarantees 
that when A = At tends rapidly to 0, namely TA T = 0(1), the approximation 
/ ~ Pa t [/] enables to achieve minimax rates of convergence (see Section 5). 
The estimator does not depend on r. 



3 Estimation of / in the slow microscopic regime 

In this Section we consider the regimes for which there exists < 5 < 1 with 
TA T = 0(T S ), up to logarithmic factors in T. 

3.1 Construction of the estimator 

We construct the estimator corrected at order K, following the estimation pro- 
cedure described in Section 1.2. 



Construction of the inverse 

Define the space 

H(s, tt,D, m) = {h, \\h\\ Ll(p) < O, IHIs^pD) < at}, 

where D is any constant strictly greater than 1 and 91 is a positive constant 
strictly greater than Wl. The space %(s,7T, £),9T) is a subset of B^^iV) which 
is a Banach space if equipped with the Besov norm (8). 

First we approach the inverse of Pa with a fixed point method. Consider 
the mapping Ha,/ defined for h in 'H(s,7r,£),9T) by 

H AJ [h]:=P A [f} + h-P A [h}. (17) 
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We immediately verify that / is a fixed point: Haj[/] = /. The constraints 
1 < O and 9JT < 91 ensure that if / is in .F(s, 7r, 9Jt), then Ha,/[/i] sends ele- 
ments of H(s, ir, D, 91) into itself (see Proposition 2). The following Proposition 
guarantee that the definition of the operator (17) matches the assumptions of 
the Banach fixed point theorem. 

Proposition 2. The following properties hold. 

1) Let 7r > 1, the space (7i(s,n, D,%1), \\-\\b s (T>)) ^ s a closed set of a Banach 
space and is then complete. 

2) The mapping Ha,/ sends elements o/ 7r, D, 91) into itself and is a con- 
traction. For all hi,h,2 G H(s,ir,Q) we have that 

||H A ,/M - HA./Nllg^p) < £(A)||/ii - h2\\ Btga( p), 

where 

£(A) = 2D(e 2r (°) A - 1) + 2r(0)A. (18) 
Moreover since At — > we have 

&(A T ) < £A T < 1 (19) 
for some positive constant £ depending on r(0) and D. 

Proposition 2 enables to apply the Banach fixed point theorem; we derive that / 
is the unique fixed point of Ha,/ and from any initial point ho in 7i(s, tt, D, 91) 
we have 

||/ - H A/ I^o] 1 1 6^(2,) -> as K^-oo, 

where o stands for the composition product and H^y is Haj ° • • • ° Ha,/, K 
times. We choose ho = Pa [/] as a starting point (Lemma 2 in Section 6 ensures 
that P A [/] belongs to H(s, tt, O, 01)). 

Proposition 3. Let it > 1 and define the operator La,x as the Kth degree 
Taylor polynomial o/H^j[Pa[/]] in A. It verifies for p > 1 

HS [Pa[/]] - L A ,x t m < CA A ' +1 (20) 

Lp(D) 

where £ is a positive constant depending on r(0), 9JT and D. Moreover we have 

K+l 

LA,K = ^m(A)P A [/]* m , (21) 

m=l 

where for m = 1, . . . , K + 1 we have |/ m (A)| < <tA m ~ l where £ is a positive 
constant that depends on r(0) and K . 
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Construction of estimators of the PaI/]*" 1 

Consider the increments (D A Aj = Xi& — Xu_i\&,i = 1, . . . , I^TA -1 ]) intro- 
duced earlier and define the nonzero ones using 

5i = inf{j,D A X j ^0} A LTA- 1 ] 

Si = inf { j > S^, T> A Xj # 0} A |rA _1 J for i > 1, 

where Si is the random index of the ith. jump. Let 

ITA- 1 ] 

i=l 

the random number of nonzero increments observed over [0, T\. By Assumption 
1, on the event {D A J*Q = 0}, no jump occurred between (i— 1)A and iA. In the 
microscopic regime when A = At — > as T goes to infinity many increments 
are null and convey no information about /, hence for the estimation of / we 
focus on the nonzero ones 

(d a x Si ,...,d a x 5jVt ). 

They are identically distributed of density given by (12); Lemma 1 still applies. 

We construct wavelet threshold density estimators of the K + 1 first convo- 
lution powers of Pa [/] ; define the wavelet coefficients for m > 1 

N T,m 

' i=l 

where iVjr,m = [-A^r / mj > 1 for large enough T and 

D A X Si = D A X 5i + D A A^ +! + • • • + D A X S(m _ 1)JVTm+s . 

Let r\ > and J G N \ {0}, define PA, m the estimator of PA[/]* m over V for 
m > 1 

J 

J\^(aO = ^a^Vofc(x) +EE$ )l (,»|>,}^( a: ). x ZzT>. (23) 

As mentioned earlier r is a nuisance that needs to be estimated. To simplify 
the problem, we make the following parametric assumption on r. 
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Assumption 3. Assume there exists i? in Q a compact subset ofM. such that 

t(x) = t\(x, i9), VxG[0,oo), 

where t\ is known, ti(0,i?) > and i? — > ti(.,i?) is C l . Assume there exists q 
from to [0, 1], invertible, such that g(#) = P(i?A 7^ 0) and whose inverse q~ l 
is bounded. 

Assumption 3 enables to estimate the unknown coefficients fp m (A)j and 
(Z m (A)), and to compute the estimator of / defined hereafter. 

Definition 2. Let & be the estimator corrected at order K defined for K in 
N and x inD as 

K+i 

(24) 

m=l 

where 

and i/ie Z m (A,i?) are defined in Proposition 3. 

When A T satisfies TAf, = 0(1), /£ A defined in (24) with K = and / t ,a 
defined in (15) coincides. 

3.2 Convergence rates 

Assumption 4. Assume that there exist (21, a, g) positive constants such that 
t(x) < 2lexp ( - ax 3 ), VxG[0,oo). 

Assumption 4 is a technical condition which ensures that r has moments of 
all order. It is used in the proofs to replace Nt/T by its asymptotic determin- 
istic limit. Compactly supported densities and densities with subexponential 
queues satisfies Assumption 4. 

Theorem 2. We work under Assumptions 1, 2, 3 and 4 and assume that there 
exists < 6 < 1 such that 

TAf. = 0{T S ), 
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up to logarithmic factors in T. Let 7r>l,o">s>l/-7r,p>l and P/± T m oe 
the threshold wavelet estimator o/Pa t [/]™ on D constructed from (4>,ip) and 
defined in (23). Take J such that 

2 J T~ 1 log (T 1/2 ) < 1, 

and 

r, = /cr-vyiog (rV2), 

for some k > 0. 

For rn > 1 i/ie estimator P/\ T ,m °/PA r [/]* m verifies for sufficiently large 

K > 

sup (E[||^i - PA T [/]* m ||^ < er-«(-*.*>, 

up to logarithmic factors in T, where a(s,p,n) is defined in (16) and where <£. 
depends on s, tt, p, SOT, 0, V" a^d 

£^ The estimator corrected at order K f^^ for K e N defined in (24) verifies 
for T large enough, sufficiently large k > and any compact set 6cl 

sup sup (E[||/£ A -/||£ ]) 1/P <(j:max(r- Q ^),A^ +1 ), 
tfee/e.F(s,7r,an) ^ ; 

up to logarithmic factors in T and where £ depends on s, ir,p, (j), ij) and K. 

The proof of Theorem 2 is postponed to Section 6.3. Since a(s,p, tt) < 1/2, 
Theorem 2 ensures that whenever At and T are polynomially related it is 
always possible to find Kq such that the estimator corrected at order Kq achieves 
the minimax rate of convergence (see Section 5). If At decays slower than any 
power of 1/T, for instance if it decreases logarithmically with T, the estimator 
corrected at order K still provide a consistent estimator of /. 



4 A numerical example 

In this Section we illustrate the results of Theorems 1 and 2. In both cases 
we compare the performances of our estimator with an oracle: the wavelet 
estimator we would compute in the idealised framework where all the jumps 
are observed 

fOrade {x) = £ ^^0^) + EE ^^Or^lM^ 
k j=0 k 



16 



where 



s o r de = ^X> fc te) and p° k rade = ~ Effete), 

i=l i=l 

i?r being the value of the renewal process R at time T and the jumps. The 
parameters J and 77 as well as the wavelet bases ((f), if)) are the same for all the 
estimators. 

We consider a renewal process with a -Beia(l,i?) interarrival density r. We 
have 1? = 3, the first shape parameter is set to 1 to ensure the condition < 
Ti(0, 1?) < 00. We estimate the compound law given by 

f(x) = (l-a)h(x) + af 2 (x), 

where f\ is the uniform distribution over [—2, 2] and fi is a Laplace with location 
parameter 1 and scale parameter 0.5, we take a = 0.5. We estimate the mixture 
/ on D = [—10, 10] with the estimator corrected at order K for different values 
of K and study the results with the L2 error. We also compare them with 
the oracle f° racle . Wavelet estimators are based on the evaluation of the first 
wavelet coefficients, to perform those we use Symlets 4 wavelet functions and 
a resolution level J = 10. Moreover we transform the data in an equispaced 
signal on a grid of length 2 L with L = 8, it is the binning procedure (see Hardle 
et al. [14] Chap. 12). The threshold is chosen as in Theorems 1 and 2. The 
estimators we obtain take the form of a vector giving the estimated values of 
the density / on the uniform grid [—10, 10] with mesh 0.01. We use the wavelet 
toolbox of Matlab. 

4.1 Illustration in the fast microscopic case 

In this case we choose A = T" 1 / 2 . Figure 1 represents the estimator /t,a of 
Definition 1 and the oracle. The estimators are evaluated on the same trajec- 
tory. They are quite hard to distinguish, what is confirmed by the comparison 
of their L2 losses. 
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Figure 1: Estimators of the density / (plain dark) for T = 10000 and A = 0.01: 
the oracle (dotted red) and the estimator fx, A (dashed green). 



We approximate the L2 errors by Monte Carlo. For that we compute M = 
1000 times each estimator (for T = 10000 and A = 0.01) and approximate the 
L2 loss by 

. M 2000 

M E ( E W- 10 + °' 01 ^ - (~ 10 + °- 01 ^)) 2 x °- 01 ) • 

For each Monte Carlo iteration the estimators are evaluated on the same tra- 
jectory. The results are reproduced in the following table. 



Estimator 


Oracle 


/t,a 


L2 error (xlCT 4 ) 


0.1916 


0.2040 


Standard deviation (x!0~ 5 ) 


0.4519 


0.4605 



4.2 Illustration in the slow microscopic case 

We now study the behaviour of the estimator corrected at order K for different 
values of K. We choose T = 10000 and A = 0.1. In that case TA 2 is large 
but TA 4 is 1. According to Theorem 2 we should observe that the estimator 
corrected at order 2, behaves as the oracle. Figure 2 represents the estimators 
fr A defined in Definition 2 for K E {0, 1, 2, 3} and the oracle. The estimators 
are evaluated on the same trajectory. They all manage to reproduce the shape 
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of the density /, and graphically apart from the estimator corrected at order 
they are difficult to distinguish. 



0.7 r 




.0.1 I 1 1 1 1 1 1 1 1 

-3-2-10 1 2 3 4 5 



Figure 2: Estimators of the density / (plain dark) for T = 10000 and A = 0.1: 
the oracle (dotted red) and the estimator f° r K = 0,1,2, 3 (dashed light 
to dark grey). 



We compare their L2 losses in the following tabular. 



Estimator 


Oracle 


K = 


K = 1 


K = 2 


K = 3 


L2 error (xl0~ 4 ) 


0.1896 


0.5176 


0.3037 


0.2959 


0.2946 


Standard deviation (xl0~ 5 ) 


0.4348 


0.7800 


0.7533 


0.7462 


0.7466 



This confirms that there is an actual gain in considering the estimator corrected 
at order 1 instead of the uncorrected one. In the following table we estimate 
the ( p m (A)) defined in Proposition 1. 



Estimated quantity 


Pi 


P2 


P3 


Estimation 


0.8527 


0.1327 


0.0135 


Standard deviation (xlCT 3 ) 


0.9185 


0.7388 


0.1597 



It turns out that making no correction is equivalent to estimate a density on 
a data set where 15% of the observations are realisations of a law which is 
not target. This explains why it is relevant to take them into account when 
estimating /. Considering more than 1 or 2 corrections is unnecessary as the 
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L/2 losses get stable afterwards. The L2 loss of the oracle is strictly lower than 
the loss of the estimator corrected at order K, even for large K. That difference 
is explained by the fact that to estimate the mth convolution power we do not 
use Nt data points but Nxm = \Nt I m \ ■ Therefore we do not loose in terms 
of rate of convergence, but we surely deteriorate the constants in comparison 
with the oracle. 

5 Discussion and Conclusion 

Attainable rates. Without loss of generality, assuming T is an integer if we 
observe T independent realisations of the density /, it is possible to achieve 
the minimax rates of convergence T~ a ( s, ' K,p > (see for instance Donoho et ai. 
[10]). When the process X is continuously observed over [0, T], we have Rt 
independent and identically distributed realisations of /. Moreover for T large 
enough, the elementary renewal theorem guarantees that Rt is of the order 
of T (see for instance Lindvall [19]). It follows that the estimators of / given 
in Sections 2 and 3 enables to attain the minimax rates of convergence of an 
experiment where X is continuously observed. 

Comparison with a previous work. The results of this paper are the 
generalisation to the renewal reward case of Duval [11]; a compound Poisson 
process is a particular renewal reward process and Theorems 1 and 2 enable to 
recover the results of [11]. However in this paper we do not have an explicit 
formula for the estimator corrected at order K but only a construction method. 
In the Poisson case it is much more simpler to apply the results of [11]. 

Extension to the case where A is fixed. We established Theorem 2 for 
A^ vanishing to 0. Since the approximation of the inverse depends only the 
fact that Haj is a contraction, the method remains valid for A's such that 
■ft(A) defined in (18) is strictly lower than 1. Which means that we can expand 
the results to cases where A does not go to but satisfies .ft(A) < 1. The value 
of the maximum value Ai satisfying the former inequality depends on r(0) and 
D but is not only determined by (18). Another hidden condition on A have 
to be satisfied for Ha,/ to send elements of ~H(s, tt, £), 0T) into itself. Then to 
find Ai one has to solve an optimisation program with constraints to find Ai 
and D giving the maximum coverage for A. To get an idea of the value of Ai 
we use the function Maximize of Mathematica and find that one should take 
D = 1.645 and Ai = 0.071/r(0) > 0, which is positive. The results of Theorem 
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2 should generalise in for all At — > A^ such that A^ < Ai and for K G N the 
rate of convergence for the estimator corrected at order K is bounded by 

max{T- Q ( s ' p ^,A^ +1 }. 

However to achieve suitable rates theoretically one should consider larger K, 
therefore the dependency in K in the constants need to be handled carefully. 
In practice for T = 10000 and A = 0.1 considering K = 2 appears sufficient to 
have T _Q ( s ' p,7r ) predominant in front of A^ +1 . 

Discussion on Assumptions 1 and 3. In the present paper we made two 
simplifying assumptions on the interarrival density r. First we assume that J\ 
was distributed according to tq to work with a process with stationary incre- 
ments. In fact if r has finite expectation this assumption is not necessary since 
asymptotically the process has stationary increments (see Lindvall [19]). The 
second assumption is that r is described by a 1-dimensional parameter Gen- 
eralising the result to a d-dimensional parameter should be possible at small 
cost, but removing all parametric assumption on r would demand to solve a 
nonstandard nonparametric program for r from the observations (2): obser- 
vations (2) only give access to truncated values of realisations of r spaced of 
more than A. Then the problem of estimating r from (2) should be considered 
separately. 

Other generalisations. We constructed in the microscopic regime an adap- 
tive minimax estimator of the jump density of a renewal reward process. The 
methodology presented here should adapt to any process defined similarly to 
X but whose counting process has stationary increments and manageable de- 
pendencies. We consider in the present paper a renewal counting measure since 
we are interested in expanding the methodology to other regimes of A, namely 
when A = Ay tends to a constant (intermediate regime) or to infinity (macro- 
scopic regime) . The macroscopic regime is of special interest since the observed 
process presents diffusive or anomalous asymptotic behaviour determined by 
the laws / and r (see for instance Meerschaert and Scheffler [21, 22] or Kotulski 
[18]) and many applications have a model based on a macroscopically observed 
renewal reward processes. For instance in physics where they are used to model 
particle motion (see Watkins and Credgington [30] or Cuppen et al. [8]), in bi- 
ology to model the proliferation of tumor cells (see Fedotov and Iomin [12]) or 
lipid granule motion (see Jeon et al. [16]), they are also used to model records 
(see Sabhapandit [25]). 
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6 Proofs 



In the sequel £ denotes a constant which may vary from line to line. 



6.1 Proof of Theorem 1 
Proof of part 1) of Theorem 1 

To prove part 1) of Theorem 1 we apply the general results of Kerkyacharian 
and Picard [17]. For that we establish some technical lemmas. 

Lemma 2. Iff belongs to J r (s,ir,'lUl) then for m > 1, Pa [/J*™ also belongs to 

To prove Theorem 1 , we use Lemma 2 for m = 1 only, but we take some advance 
on the proof of Theorem 2. 



Proof of Lemma 2. It is straightforward to derive UPaI/]*" 1 )!^^ = 1- The 
remainder of the proof is a consequence of the following result: Let / 6 J B* OD (2?) 
and g £ L\ we have 



\\f * gWsi^v) < ll/lk^llslki 



(25) 



To prove (25) we use the definition of the Besov norm (8); the result is a 
consequence of Young's inequality and elementary properties of the convolution 
product. First Young's inequality gives 



Wfi * h\\L^( 



< 



'iIlM 



T2\\Li( 



(26) 



Then the differentiation property of the convolution product leads for n > 1 to 



d n 
dx 11 



(/l*/ 2 ) 



< 



dx n 



\\f>. 



2||Li 



(27) 



Finally translation invariance of the convolution product enables to get 

\\n h n h [(h*h)W]\\ LAV) = \\(B h n h [fi%*f 2 \\ LAV) 



(28) 



Inequality (25) is then obtained by bounding ||/*<?||bs (p) using (26), (27) and 
(28). To complete the proof of Lemma 2, we apply m— 1 times (25) which leads 
to 

Vm 6N \ {0}, UPAl/rll^^ < ||Pa[/]|| b?oo(c) - 
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The triangle inequality gives ||Pa [/]* m ^^(X)) < Wfh^v) < W which con- 
cludes the proof. □ 

Lemma 3. Let 2 J < T, then for p > 1 we have 

^k-i Jk \ p ]<£ pMLp{R)m T- p/ \ 

where jjk is defined in (13) and 

l 3 k = J g jk (y)PA[f](y)dy. (29) 

Proof of Lemma 3. The proof is obtained with Rosenthal's inequality: let p > 1 
and let (Yy, . . . , Y n ) be independent random variables such that E[i$] = and 
E[|li| p ] < 00. Then there exists £ p such that 



E 



8=1 J ^ 1=1 8=1 ^ 



According to Proposition 1 the (D At JQ) have distribution 

/d^wi, =p(A t )5 (x) + (1 -p(A T ))P AT [f](x), xeV 
where So is the Dirac delta function and p(Ay) = P(i?A T = 0). We derive 

E [^jk] = J 9jk(z)t {z7 L } ^ D _ ^T^T dz = J gjk(z)PA T [f](z)dz = j jk . 

Then jjk— Jjk is a sum of centered and identically distributed random variables, 
define 

Zi = 1 - p (A T ) 9jk ^ AT X ^ 1 ^ r>ATX ^°y 

Since x is a renewal reward process, nonzero and nonconsecutive Zi are indepen- 
dent, then if we separate the sum in two sums of nonzero and nonconsecutive 
indices we can apply Rosenthal's inequality for independent variables to each 
sum, it wont affect the rates but the constant will modified. For p > 1 we have 
by convex inequality 



E[|Z i -E[Z i ]| p ] <2*>E[|^| P ] 

2P23P/ 2 



< 



(I-p(At)) 



p-i 



g(2 3 y - A;)| p 1{^ }/d a t (y)dy 
J \g( z )\Pp AT [f]^^.y z , 



[l-p(A T )) p 
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where we made the substitution z = 2 3 y — k. Lemma 2 and Sobolev embeddings 
(see [4, 10, 14]) 

^oo^^p'oc and H^o^BSooo, (31) 

where p > tt, sir > 1 and s' = s — 1/ir + 1/p, give UPa^/IH^ < 9Jt. It follows 
that 

E[\Zi -E^lH < 2^^f/ 2 - 1 )|| 5 ||^ (R) 9Jr/(l-p(A T )r 1 

and 

E^-E^ 2 ] <9Jl/(l-p(A r )) 
since ||5 , ||l 2 (ir) = 1- Rosenthal's inequality (30) gives for p > 1 

E[i%*-7i*n <^{ 2P (£) f ~ 1 ii3ir MK )^+^ p/2 }A- f , 

where At = [TA^ 1 ] (1 — p(Aj-)). To conclude we use that 

1 - p(A T ) = P( Ji > A r ) = - (1 - 

Jo 

since Ji has distribution (1), and derive that there exists Ai > such that 
F(A\) < 2 and for all At < Ai we have 

|^<l-p(A T )<^. (32) 

It follows that 

T „ T 

— < A T < - 

2fl jjL 

and then using 2 3 < T 

The proof the complete. □ 
Lemma 4. Choose j and c such that 



2 j T- 1 \og(T 1 / 2 ) < 1 and c 2 > 32//(9Jt + 



g||g||oo 

6 

For all r > 1, let K r = cr. We have 



7 J ,-7, fc |>yT- 1 /yi og( Ti/2))<T-'-/ 2 , 



where is defined in (13) and jjk in (29). 
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Proof of Lemma 4- The proof is obtained with Bernstein's inequality. Consider 
Yi, . . . ,Y n independent random variables such that \Yi\ < 21, E[Y^] = and 

b l = ELi E K 2 ]- Then for an y A > °i 

A 2 



'(|^y i |>A)<2e X p(-^ T ^). (33) 



i=l 



We keep notation Z{ introduced in the proof of Lemma 3, 7jfc — is a sum of 
centered and identically distributed random variables bounded by 2- ? / 2 ||g|| 00 / (1— 
p(At)) which verify 

E[|^-E[^]| 2 ] < £01/(1 -p(A T )). 

After separating the sum to get two sums of nonzero and nonconsecutive indices 
we apply Bernstein's inequality (33) for independent variables to each sum, 
which modify the constants. It follows that 



'(fe-^l^yT-Vyiog^) 

^r- 1 iog(r 1 /2 )L TA T 1 j(i-p(A T ))' 



<2 exp 



16(^+ K - r ' 1/2 ^f 7iy2J/2||g|1 - 



Using that 2iT~ l log(T 1 / 2 ) < 1 and (32) which gives 

1 



T- 1 LTA f 1 j(l-p(A T ))> 2/t . 



we have 



7i fc -7i fe |>yT-Vyi g(rl 



,-.2 ? 

32//(9JT + 

since c 2 > 32/i(9H + c ^ 9 ^°° ) . The proof is complete. □ 



<2exp ( ^ rlo g (T^)) < T^, 



Proof of of part 1) of Theorem 1. It is a consequence of Lemma 2, 3, 4 and of 
the general theory of wavelet threshold estimators of Kerkyacharian and Picard 
[17]. It suffices to have conditions (5.1) and (5.2) of Theorem 5.1 of [17], which 
are satisfied -Lemma 3 and 4- with c(T) = T -1 / 2 and A n = c(T) -1 (with 
the notation of [17]). We can now apply Theorem 5.1, its Corollary 5.1 and 
Theorem 6.1 of [17] to obtain the result. □ 
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Completion of the proof of Theorem 1 

To prove part 2) of Theorem 1 we decompose the L p loss as follows 

mfTA T -f\\i v{v) ]) llp 

< (E[||/ T)At -PA T [f]\\ P LA v)}) 1/P + ll P A T [/] " f\\ Lp{V y 
An upper bound for the first term is given by part 1) of Theorem 1 

(E[||/ r , AT -PA T [f]\\l pm }) 1/P < £T-^\ (34) 
where £ continuously depends on /z, and on s, ir,p, 9JT, <fi, ip and /i. Since 

oo 

Pa t [/] -/ = -(!- Pi (A T ))/ + ^ Pm (A T )P AT [f]* m 

m=2 

Lemma 1, Young's inequality, which gives ||PA T L/]* m ||L p (£>) < ||/||l p (z>) and 
Sobolev embeddings (31), which give ||/||z, (d) < SCTl, enable to get the bound 

oo (2 T (0)A ) m ~ 1 

ll P A T [/] - f\\ Lp{v) < 2r(0)A T + 29JT £ 1 ^ < £A T , (35) 

m=2 ' ' 

where £ continuously depends on t(0) and We finish the proof noticing that 
(34) is predominant in front of (35) since a(s,p,n) < 1/2 and TAj. = 0(1). 
Finally we take the supremum in fx and r(0) over any compact of (0, oo) to 
render the constant independent of the unknown interarrival law r. The proof 
is now complete. 



6.2 Proof of Proposition 2 

First we prove part 1) of Proposition 2. The set H(s,ir,0,yi) is a subset of 

which is a Banach space. We show that %(s, 7r, D, 9T) is complete since it is a 
closed subset of a Banach space. For that we establish the following assertions; 
for all sequence h n E %(s, 7r, D, 9T) such that there exists h with 

\\hn ~ h\\^s mj) ~~ 0, as n — > oo, 
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we have h G 7i(s,n,D,yi) i.e. \\h\\Q S (p\ < 9T and H/iHl^x?) < O. The first 
inequality is immediate. The second one is a consequence of the compactness 
of V. Indeed 

\\h>n ~ h\\j3s (£>) — > 0, as n — > oo, 
we have by definition of the Besov norm (8) that 

\\h n - h\\ Lv (pj -»• 0, as n -> oo. 

Since 2? is compact and 7r > 1 we derive from Holder's inequality that 

\\hn ~ hW^CD) -^0, as n — > oo 

and then ||/i||z,i(X>) — ^ follows. The proof of part 1) of Proposition 2 is now 
complete. 



To prove part 2) of Proposition 2 we show that Ha,/ sends elements of 
%{s, it, Qjtyl) into H(s, 7r, £>, OT) and that it is a contraction. We start with the 
first assertion, the triangular inequality gives for h G H(s,-K,Q,y\) 

oo 

||H Ai/ [/ l ]|| ii(2 , ) <||P A [/]|| ii(2 , ) + (l-p 1 (A))||/ i || 1 +^p m (A)||/ l * m || Ll(c)! 

m=2 

where ||Pa[/] ||t 1 (xi) — ||Pa[/] H^hr) = ^- Immediate induction on Young's 
inequality leads to 



ll^k (c) <INI^)<o m 

since /i G 7r, D, 9?) and with Lemma 1 we get 

HhajWIL ^ 1 + 2£ M°) A + ^ (e2 ° r(0)A " 1 " 2Dr (°) A ) ^ D 

for A small enough since O > 1. Similar computations and (25) give 

||h a , / [/ 1 ]|| b , ooP) <||Pa[/]|| b , oo(i , ) + (i-pi(a))||/ 1 ||^ oo(c) 



+ X> m (A)||/ l Hk 



m=2 



<9Jt + 2r(0)A9T + -£-(e fflT(0)A - 1 - 2Dr(0)A) < % 
r(0) AD 

for A small enough since 9Jt < 91. Then if /i is in 7r, D, 91), Ha,/ belongs 
to H(s,ir,D,m). 
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For the contraction property, we have for all hi,h 2 £ ~H(s, n,£),yi) 

H A ,/M - H A ,/N =(1 - Pl (A))(fci - /i 2 ) 

oo m— 1 

- (fci Pm(A) £ ^ 9 * /^r" 1 " 9 (36) 

m =2 <j=0 

Lemma 1 gives 

0< l-pi(A) < 2r(0)A, (37) 
and with Young's inequality and since hi and h 2 belong to %{s } n, O, 91) we get 

oo m—1 

\\Y,Pm( A )J2 h i Q * h T^ q < 2^ (e 2r (°)° A - 1) (38) 

"m=2 ? =0 LliP) 

for A small enough. Finally injecting (37) and (38) into (36) leads to the 
contraction property for all h\,h 2 G H(s, w, 0, 9t) 

||H A ,/N - H A ,/[/» 2 ]|| B . oo(1 , ) < (2r(0)A + 2D(e^°)° A - l)) \\hi - h 2 \\ B s oo(p) , 
which concludes the proof. 

6.3 Proof of Theorem 2 
Preliminary 

The estimators of the convolution powers of Pa[/] depend on Nt which is 
random and depends on the D^JQ. 

Lemma 5. Work under Assumption 4 and let p(A) = P(i?A 7^ 0) and Ai be 
such that J^ 1 r{x)dx < \. Then for all A > and A < Ai we have 



> AA) < exp ( - £VTa) , 



[TA 

where €, depends on 21, a, q, fj,, A. 
Proof of Lemma 5. We have 

ITA- 1 ] 



N T 



LTA-iJ " v ; [TA" , , 
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where 

Y = 1{dax^} - P(A), i = 1, • • ■ , L^A- 1 ] 
are centered random variables, bounded by M = 1 — p(A) and such that 

EK 2 ]<p(A). 

To show the result, we apply Theorem 4.5 of Dedecker et. al. [9] which is 
a Bernstein-type inequality for dependent data. We have to verify conditions 
(4.4.16) and (4.4.17) of Theorem 4.5 of [9]. With their notation, condition 
(4.4.16) ensures that for all u-tuples (si,...,s u ) and all u-tuples (ti,...,t v ) 
such that 

1 < si < ... < s u <h < ... <t v < [TA~ 1 \ 

we have 

\Cov{Y Sl . . . Y Su ,Y tl ...Y tv )\< K 2 M U + V ~ 2 uvp(t x - s u ), 

for some positive constant K and a nonincreasing function p satisfying (4.4.17) 
namely 

oo 

where L\, L2 and v are positive constants. 

Since X is a renewal process, Y Sl . . . Y Su and . . . Yf v are independent if 
there exists r such that s u < r < t\ and Y r = 1 — p(A) i.e. there is a jump 
between Y Su and Yt x . For the covariance to be nonzero it is necessary that no 
jump occurred between s u A and (ti — 1)A. Let s = t\ — s u — 1 using that R is 
stationary we get an upper bound for p 

/•oo 

p(h - s u ) < F(R (tl _ 1)A - R SuA = 0) = F(R sA = 0) = / r (x)dx (39) 

JsA 

which decreases with s. Moreover since the Yi are centered and bounded by 
M < 1 we have by Cauchy-Schwarz and IE[^ 2 ] < p(A) 

I Cov(Y sl ...Y Su ,Y tl ...Y tv )\ < I Cov (Y n , Y t J | < ^/e\Y^\E\Y^\ <p(A). 

We deduce that condition (4.4.16) is fulfilled with K = p(A) 1 ^ 2 and the nonin- 
creasing sequence p. 

Next we show that p satisfies (4.4.17), using Assumption 4 and (39) we get 
for s > 1 

p(s)<- (1- r(t)dt)dx < £exp ( - o(sA) '), 

M JsA T Jo 
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where g < g' and <£ depends on 21, o, fi, g. Which leads to for k > 



oo 1/A oo 

Y,s k P(s)<Y. sk + <t E s fe exp(-a( S A/) 

s=l/A 



s=l 



s=l 



< A -( fc+1 ) + CA- fe £ s /fe exp ( - a( S y') 

s'=l 

< £ A -(fc+l) 



(40) 



where £ depends on 21, a, g, /i, condition (4.4.17) follows with L\ = £A , L2 
A -1 and v = 0. 

We can now apply Theorem 4.5 which gives for all A > 
N T 



[TA- 1 ] 
< 2 exp 



p(A) 



> AA 



LTA-!J 2 A 2 A 2 



2(LTA-!Jp(A) + (LTA- 1 JAA) 3 / 2 v / 25A 3 2) 
Using (32) we derive for A < Ai 



N T 



p(A) > AA) < exp ( - cVta) , 



[TA-\ 

where £ depends on A, 21, a, g, \i. The proof is now complete. 



□ 



Proof of part 1) of Theorem 2 

As for the proof of part 1) of Theorem 1 we apply the general results of Kerky- 
acharian and Picard [17] and first establish some technical lemmas. 



Lemma 6. Let 2 J < T , then for p > 1 we have for all m > 1 

rp-p/2 



where 7^ is defined in (22) and 



(m) 



g j k(y)PA[fV m (y)dy. 



(41) 
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Proof of Lemma 6. For m > 1, 7^ — T^fc * s ^ ne sum °^ \_Nt/tti\ identically 
distributed random variables, where Nt is random. First we replace Nt by its 
deterministic asymptotic limit using the following decomposition 



(to) ip-l 



=E[|7. 



(to) 



(m) 



(to) ip 



-p(A T ) >A 



}1 



. Tu<\\^-(m) (m) ip., I 



Take A = 1/4// and denote n m = |_T/m/xJ and n' m = [T/(4m/i)J , we have with 
that (32) that 



\9 ooil 



•p(A 



> 



t/i) 



+ E 



-L^(D^ 5i -E[D^]) 



1=1 



For the first term of the right hand part, TAt = 0(T S ), Lemma 5 and 2 J < T 
leads to 



^J__ p(At )| > ^) <^/ 2 exp(-CT 5 ) < (texp(-T*p), (42) 

for some 8' < 5 and where £ depends on p, ||p||oo,2l, dj/*- For the second term 
we apply Rosenthal's inequality (30). Since X is a renewal process the variables 
(D m Xs 2i ) . are independent but dependent of the variables (D^X,s 2i+1 ) which 
are independent. It ensures that the variables (D m X,g.) are distributed accord- 



ant) 



(to) 



between odd 



ing to PA T [/]* m - Moreover if we separate the sum 7k ' — 7j fc 
and even indices we can apply Rosenthal's inequality for independent variables 
to each sum. For p > 1 we have by convex inequality 



E[\g jk (T>iX Sz 



ljk\ 



<2*>E[\g jk (T> m X Si )\ P ] 

< 2 n^ 2 J \g(2?y - k)\PP AT [fr n (y)dy 



z + k 



21 J 



)dz, 



where we made the substitution z = 2 3 y — k. Lemma 2 and Sobolev embeddings 



(31) give P At [JT 



< SDT. It follows that 



E[\g jk (D£x Si 



1jk\ 



< 2 p 2 j ^ 2 -V\\g\\ p r 
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and 



since 



E[\g jk (B^X Si )- Jjk \ 2 ] <m 
1. We derive for p > 1 



E 



' I'm . 



i=l 



-P/2 



(43) 



It follows from (42) and (43) that 

E[|7]r } - 7S°n <Cexp(-T«) + cr-l < CT-i, 

since the first term is negligible in front of the second as 8' > where £ depends 
onp,m,\\g\\ Lp (T S ),\\g\\ 00 ,flL,a,Wt,fi. It concludes the proof. □ 

Lemma 7. Choose j and c such that 

2 j T~ 1 log(T 1/2 ) < 1 and c 2 > 256mn(wi + 
For all r > 1 let n r = cr. We have for all m > 1 
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P |7 



(m) (m) 



Ijk ^jk i — 2 

where 7^ is defined in (22) and 7^ in (41). 



-r/2 



Proof of Lemma 7. As for the proof of Lemma 6 we decompose as follow for 
m > 1 



< 



N T 



LTA" 1 



p(a t ; 



> 



4/x ) 



+ p (l^E (°-^ -e[d£x Si ])| > fr-vyiog^)), 

m i=l 

where re m = |_T/m^J and = |_T/(4m/x)J . From TAj, = 0(T S ) and Lemma 
5 we derive 



[TA~\ 



p(A T ) 



> 



A 7 



< exp 



(44) 
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where £ depends on 21, a, fj,. For the second term we apply Bernstein's inequality 
(33) and as in the proof of Lemma 6 we separate the sum between odd and even 
indices to work with independent variables. We get 



i=l 

<2 exp [ - - 



itn'jT-HogiT 1 /*) 



16 n ra aJl + 



Kr n> m T-l/2y/\og(Ti/2)2J/2\\g\\ 



<2 exp 



2 

c r 



■logCT 1 ^) 



With 2JT- 1 ^(T 1 / 2 ) < 1 and c 2 > 256m/x(9tt + ^h°) we have for r > 1 

H f-E( D ^ -e[d^ S( ]) > ^T-vyiog^/ 2 )) < <rr. ( 45 ) 

It follows from (44) and (45) that 

(Tj-A _|_ j>-r/2 < rp-r/2 



in 



(m) (m) I 
jk ~ ^jk I — 2 



> ^T-V 2 ^ log(r i/ 2) ) < 



exp 



since the first term is negligible in front of the second since 5 > 0. It concludes 
the proof. □ 



Completion of the proof of part 1) of Theorem 2. It is a consequence of Lemma 
2, 6, 7 and of the general theory of wavelet threshold estimators of Kerkyachar- 
ian and Picard [17]. It suffices to have conditions (5.1) and (5.2) of Theo- 
rem 5.1 of [17], which are satisfied -Lemma 6 and 7- with c(T) = T -1 / 2 and 
A n = c(T)- 1 (with the notation of [17]). We can now apply Theorem 5.1, its 
Corollary 5.1 and Theorem 6.1 of [17] to obtain the result. □ 

Completion of the proof of Theorem 2 

To prove Theorem 2 we define for K in N and xinD the quantity 

K+l 

7t,a( x ) = £im(A,0)i\^(a:). 

m=l 
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It is the estimator of / one would compute if r were known. We decompose 
the L p error as follows 

m\f¥,A T - f\\i piv) ]) 1/p <m\f¥,A T - 7?aX p{ v ) ]) 1/p 

and control each term separately. 
First we look at the second term 

m\i¥ AT mwfrAr -^ T , K \\ p Lp{v) ]) i/p 

+ \\La t ,K ~ H Aj[ P A T [/]|| Lp(:D) 
/L x A T Uli ~ J \\ Lp(p) 



+ ||H25[P AT [/]]-/|L„ ror (46) 



An upper bound for the first term is given by part 1) of Theorem 2, given the 
definition (21) of 1*a t ,k an d Triangular 's inequality we derive 

(E[||7£a t " ^k\\I p{ v)]) 1 ' P < £T~^\ (47) 

where £ depends on s, ir,p, Wl, cfi, tp, and K. By (20), we have 

||L At ,* - Hf f [P AT [f]\\ Lp{v) < £A« + \ (48) 

where £ depends on t?, D and 9Jt. For the last term we use the fixed point 
theorem's approximation, first we have to relate the L p norm with the Sobolev 
one. Triangular's inequality ensures that if / is in B^ (V) then H^fy [Pa t [/]] — 
/ is in B^^iV). It follows using Sobolev embeddings (31) that 

|| h aM Pa t[/]] - f\\ Lp (v) ^ II h aj[ Pa t[/]] -fh^vy 
We now use the approximation given by the Banach fixed point theorem 

||H2&[Pa t [/]] -fh^fp) <^(AT) K ||H A) /[PA r [/]] -PA T [/]|| fl ^ (17) . 

After replacing Haj[Pa t [/]] by its expression and using triangular's inequal- 
ity we have 

||h Ai/ [Pa t [/]] - PA.mll^p) < £A T , 

which leads to 

||H2&[P At [/]] " f\\ Lp{ v) ^ £&t&(At) K , (49) 
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£ depends on <&, ffl, O, K. We conclude by injecting (19), (47), (48) and (49) 
in (46) and taking the supremum in i? over the compact set 0. 



T ii£ (d)]j ^e triangle inequality leads to 



We now control E[||/£ At - f£ A 

m\MA T -i¥ A x pm ]) l/p 

K+l 

< £ (E[||(/ m (A T ,^)-/ m (A T , 1 9))i^||^ p) ]) 1/p 



where PA T ,m does not depend on $ (see (22)). Cauchy-Schwarz inequality leads 
to 



IE [|| (UA T , *r)-im(A T , 0))PAT,m\\ P L p <p)] 



< E 



\l m (A T ,tf T )-l m (A T ,ti)\ 



2p 



E 



\ P A T . 



:2p 



T.m|lL p (I>) 



where using part 1) of Theorem 2, the triangle inequality and that T > 1 we 
have 



E 



-Pa 



|2p 

t^|Il p (D) 



P/\ T ,m PA T [/] Wl p (V) 



< E 



+ ||PA T [/]* m ||? p(:D) 



(50) 



where £ depends on s, n,p, 9JT, <p, ip, $. We conclude the proof with the following 
Lemma, proof of which is given in the Appendix. 

Lemma 8. Work under Assumptions 3 and 4- We have for all r > 2 

E[|/ m (A T , r T ) - l m (A T , tf)H < <t(T l ~ r + T~ r / 2 ) 

where £ depends on r, 21, a, 

It follows from (50) and Lemma 8 applied with r = 2p that 

e[II/£a t - f¥,A T h pm ] 1/p < c(ri-v(*) + r-v^), 



where £ depends on s, ir,p, Wl, (ft, ip, 21, a, We deduce for p > 1 
sup sup (E[||/£ AT (^-g AT ||i p(I , ) ]) 1/p 

where £ depends on s, it, p, 9JI, (p,^,Ql,a,K. It is negligible compared to T _a ( s ' p ' 7r ) 
since a(s,p, 7r) < 1/2. The proof of Theorem 2 is now complete. 
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Appendix 

Proof of Proposition 1 

Let x 6 R, we have by stationarity 

P(D A X 5l < x) = F(X A < x\X A / 0) 

oo 

= P ( X A < x\R A = m,R A yt 0)F(R A 



_ _ _ m) 

m=0 

oo 



= Pm(A)P(X A < x\R A = m) 

m=l 

where F(X A < x\R A = m) = f* m {y)dy for m > 1. It follows 
P(D A X 5l <x) = f P A [f](y)dy. 

J — oo 

Proof of Lemma 1 

We start with the second assertion. For m > 1 we have 

p (A) - ¥{Ra = m) 
First we derive the lower bound 

1 f A A 
1 _ p(i? A = 0) = 1 - P(Ji > A) > - / 1 - F(A)dx > — (51) 

^ A) 2/x 

since F is a cumulative distribution function; it is positive, increasing and 
continuous with -F(O) = 0. Then there exists Ai such that for all A < Ai we 
have F(A) < ^. Second we have for all m > 1 



m) < P( Ji + . . . + J m < A) = / 

J o 

where for all x £ [0, A] 

_*m— „.m— 1 



A 

TQ^T* m - 1 {x)dx, 



T *T* m - i ( X )=X m - 1 T (xtx) T(xt 2 )... 

Jo Jo 

1— 1\ — t m — 2 

r(xt fn _i)r(j;(l - ti - ... - i m _ 2 - i m -l))^i • • • c^m-i- 



o 
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We derive 



< x m - x sup r Q (i)( sup r(t))" 1 " 1 / / . . . / dtl . . . dtm-l 



te[o,a;] te[o,x 

1 . ry-m— 1 

^(supr^))™' 1 



/Ae[o,A] (m-1)! 
since 

1 /•!— tl /"I— tl— —— tra-2 



c?ii . . . (it 



o io " 'Jo 1 

1 1 A m 

A = m)<-(supr(t)) m - 1 — . (52) 
M te[o,A] m! 

Since r is continuous, there exists A2 such that 

sup r(t) < 2r(0). 

tG[0,A 2 ] 

Taking Ao = Ai A A2, (51) and (52) lead to the second assertion. The first one 
is straightforward from the previous computations. 



It follows that 



Proof of Proposition 3 

According to the definition of La,k inequality (20) is immediate. The de- 
pendency in t(0) and Tl of the constant is a consequence of Lemma 1, part 
2) of Proposition 2 and Lemma 2. A rearrangement of the terms enables to 
write La,k as a sum of increasing powers of Pa[/]*™. Thus we have to prove 
that only the K + 1 first convolution powers of Pa[/] intervene and that the 
coefficient Z m (A) in front of PaI/]*" 1 in the rearrangement satisfies 

|UA)| <£ T (o)A m - 1 . 

For that we show that for all L > 1 the Taylor expansion of order L in 
A of H^[Pa[/]], that we denote 1*a,k,l, only depends on Pa[/]*"\ tu = 
1, . . . ,L + 1 with coefficients such that Z mj x(A) < (£ T (o)A m_1 . We prove the 
result by induction on K. For K = 1 we immediately have the result by 
Lemma 1 since 

00 

H A ,/[Pa[/]] = 2P A [/] " X>m(A)PA[/r, 

m=l 
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it follows that 



L+l 

La,l,i = (2-pi(A)P A [/] - ^p m (A)P A [/r 



m=2 

, m— 1 



with /i,i(A) = (2-pi(A)) < 2 and Z m ,i(A) = p m {A) < f T ( )A m_1 . Then using 
the definition of Ha,/ we have 



H^ +1) [P A [/]] =Pa[/]+H25[P a [/]] -£p m (A)(H25[P A [/] 



r o(K+l) | 

1 A[J]j -'A[J]T "A,/ I 1 A LJ J J - 

m=l 

The induction hypothesis and Lemma 1, with part 2) of Proposition 2 which 
ensures that H^j [P A [/]] G W(s,7r,0,9t), lead to 

L+l 

La,l,x+i = Pa[/] + l a,l,x - 2^ Pm(A) \La,l,k) 

m=l 

L+l L+l L+l 

= Pa[/] l^l^AjPAir - $> m (A)( ^ L',l(A)P a [/] W 

m=l m=l m'=l 

L+l _ 

= ^Z m>i+1 (A)P A [/]* m , 

m=l 

where Zi i l + i(A) = 1 and 

m 

r m , L+ i(A) =7 m , L (A)-^p fc (A) ^ r ni>L (A)...7 nfciL (A) 

fe=l ni+...+n fc =m 

which we bound with Lemma 1 and the induction hypothesis by 

m 

|T m>L+ i(A)| < C^A^ + J^A*- 1 ^ A" 1-1 . . . A"* -1 

fc=l ni + ...+n fe =m 

= t( A'"" 1 + mA™- 1 ) < CA"- 1 , 



where £ is a positive constant depending on r(0) and K. We conclude the proof 
having L = K and l m (A) = l m ,K for m = 1, . . . , K + 1. 
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Proof of Lemma 8 



Preliminary 

Lemma 9. Work under assumptions 4 and 3, for all r > 2 

E[|^-i?| r ] < {(T 1 ^ + T~ r / 2 ), 
where <£ depends on r, 21, a, i? and $t is defined in Definition 2. 

Proof. Let r > 2, the proof is a consequence of Proposition 5.5 of Dedecker et 
al. [9] which is a Rosenthal type inequality for dependent data. Define 

[TA _1 J 

S T = Yl 
1=1 

where So = Xq = and the lj = 1d a x 4 ^o ~~ are centered identically 
distributed random variables bounded by 1. To apply Proposition 5.5 of [9] we 
have to verify that (Yi) is a sequence of #i j00 — dependent random variables. For 
that Proposition 2.3 of [9] ensures that it is sufficient to have a dependent 
sequence which is defined as follows with notation of [9]; Let T(u,v, k) be the 
set of (i,j) in Z u x Z v such that 

h < . . . < i u < i u + k < ji < . . . < j v , 

we have to show that for all / G T u the set of bounded function from M. u to 
R and for all g G Q v the set of Lipschitz function from W to R with Lipschitz 
coefficient denoted Lipg the sequence 6(k) defined as 



9(k) = sup sup sup 



\Cov(f(Y il ,...,Y iu ),g(Y jl ,...,Y jv ))\ 



u.v 



(i,i)er(«,D,fc)/eJ c " ll ,(?eg t , w ll/l|ooLip5' 



tends to 0. We denote as Yi and Yj respectively (Y^ , . . . , Yi u ) and (Yj 1 , . . . , Yj v ), 
and due to the fact that X is a renewal process Yi and Yj are independent if 
there exists r such that i u < r < j\ and Y r = 1 — p(A) i.e there is a jump 
between Yi u and Yj ± . We denote by A the event "there exists r such that 
i u < r < ji and Y r = 1 — p(A)v It follows that 

[CoviftX&giXj))] = \E[[f(Xi) - E[/(Xj)]) (g^Xj) - g{Qj))t{ A }] \ 
<2||/|| 00 Li P5 E[||X j ||l {A} ] 
< 2 W ||/|| 00 Lip( 7 P(i? fc A + 0), 



39 



since \\Xj\\ < v as the Yi are bounded by 1, for every L p normp > 0, and E[l{^}] 
is bounded by F(R k% / 0). We immediately derive that 6(k) < 2F(R kA ^ 0) 
and by Assumption 4 we derive 

9{k) < £exp(-a(£;A) '), (53) 

where g < q' , it tends to 0. We verify the hypothesis of Proposition 5.5 of [9] 
and get for all r > 2 

E[\Sr\ r ] ^^(LTA- 1 ] £ ^- 2 ^) + (LTA- 1 j £ ^)) r/2 ) 

j=l i=l 

where £ depends on r. Since we have the upper bound (53), we derive applying 
(40) with k = and k = r - 2 

ITA- 1 ] ITA- 1 ] 
J3 < £A and Yj i r ~ 2 0{i)Q{i) < 

i=l i=l 

where <£ depends on 21, a, $. It follows 

I ^^ I7 E[|5 T r]<c(T 1 - + r-/ 2 ), 

where £ depends on r, 21, o, The case r = 2 is a consequence of 

E 0^l 2 ] = ^^v(y 1 ) + -^ £ CooC*,^) 

l<i<i<T 

and the upper bounds V(Yj.) < £A where £ depends on i? and 

ICo^Fi+fc)! <£exp(-afcA). 

We derive 

]2^i]a E 0«rl a ]=Cr- 1 

where <£ depends on 21, a, #. We conclude the proof using Assumption 3, for all 
r > 2 

E[|tf? - = E[\q-\qfy) - q- 1 (qm r ] 

< Wq- 1 ^ E[\q(fr) - q(#)\ r ] < £(T 1 ~ r + T^ 2 ), 

where £ depends on r, 21, a, □ 
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Completion of the proof of Lemma 8 

The remaining of the proof is now based on the fact that under Assumption 
3 the functions i? — > p m {.,'Q) are Lipschitz continuous. We show that their 
derivative with respect to $ is bounded, we have for m > 1 that 

0*kn(A,0)] = A j nN ( I dfa{.,V)*T^\,m*)dz 

J T2{Z,V)dz JO 



Jo T 2 (z,$)dz J 

T 2 (x,$)*Tl m (.,$)(z)dz^J (54) 

where T2(., l &)/fJ, is the density of Ji. Immediate induction gives for m > 1 

a*fa(.,0)*Tr(., <>)](*) =^[r 2 (.^)]*rr(.^)(^) 

+ mr 2 (, i?) * ^[n (., 0)] * r{ m ~ l (. , 0) (z) (55) 

and 

/ g* m (x)dx < CA™- 1 (56) 

JO 

for some constant <£ and any bounded function g supported by (0, oo). Moreover 
we have 

00[t 2 (z,0)] = - / d#Ti(x,$)dx, 



and it follows from Assumption 3 that for A small enough we have Vz < A 

< < n(z,$) < 2X1(0,1?), (57) 

and that its derivative is bounded over [0, A]. Finally we bound (54), using 
(55) (56) and (57), we get 

|^[p m (A,tf)]| <£A m ~\ 
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where <£ continuously depends on $. Then taking the supremum in •& over the 
compact set we derive 

|^[p m (A,tf)]| <€A m -\ 

where £ is a positive constant independent of i9. It follows that for m > 1, the 
functions — > p m (-, "&) are Lipschitz continuous and with Lemma 9 we derive 

E[\ Pm (A,^)-p m (A,$)\ r ] KCA^Ell&r-^) 

< ^(T 1 - 7 ' + T- r / 2 ), 

where <£ is a positive constant depending on r, 21, o, i9. We conclude the proof 
using that l m (A,$) = Z(pi(A, i?), . . . ,p m (A, where / is Lipschitz in every 
argument and the argument are bounded by 1. 
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